OSMAN ― A Novel Arabic Readability Metric

نویسندگان

  • Mahmoud El-Haj
  • Paul Rayson
چکیده

We present OSMAN (Open Source Metric for Measuring Arabic Narratives) a novel open source Arabic readability metric and tool. It allows researchers to calculate readability for Arabic text with and without diacritics. OSMAN is a modified version of the conventional readability formulas such as Flesch and Fog. In our work we introduce a novel approach towards counting short, long and stress syllables in Arabic which is essential for judging readability of Arabic narratives. We also introduce an additional factor called “Faseeh” which considers aspects of script usually dropped in informal Arabic writing. To evaluate our methods we used Spearman’s correlation metric to compare text readability for 73,000 parallel sentences from English and Arabic UN documents. The Arabic sentences were written with the absence of diacritics and in order to count the number of syllables we added the diacritics in using an open source tool called Mishkal. The results show that OSMAN readability formula correlates well with the English ones making it a useful tool for researchers and educators working with Arabic text.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Abstracts/Journal of the Arabic Language and LiteratureVol.14, No48, autumn 2018

Contents The Representation of Culture in Arabic pedagogy books to non-Arabic languages Danesh Mohammadi, Sakineh Zarenejad....................................................... 1 Critical Study of the‏‏manifestations of Mamluke's life from the novel “Alsaeroun‏‏niyam...

متن کامل

A Novel Metric For Measuring The Readability of Software Source Code

Software metrics gives developers and their client and users of Software information about the quality of their Software products. They are employed in taking decisions at various levels of Software Life Cycle (SLC). This paper presents a novel metric for measuring the readability of Software Source Code (SC). Maintenance changes (addition, removal or modification), though, are initially carrie...

متن کامل

Optimizing Informativeness and Readability for Sentiment Summarization

We propose a novel algorithm for sentiment summarization that takes account of informativeness and readability, simultaneously. Our algorithm generates a summary by selecting and ordering sentences taken from multiple review texts according to two scores that represent the informativeness and readability of the sentence order. The informativeness score is defined by the number of sentiment expr...

متن کامل

AARI: automatic arabic readability index

Text readability refers to the ability of the reader to understand and comprehend a given text. In this research, we present our approach to develop an automatic readability index for the Arabic language: Automatic Arabic Readability Index (AARI), using factor analysis. Our results are based on more than 1196 Arabic texts extracted from the Jordanian curriculum in the subjects of: Arabic langua...

متن کامل

The Necessity for Reconsideration of Arabic Novel Translations A critique of Symphony of Destiny

 this paper is a review of the Persian translation of the popular Arabic novel, Kunshirtu al-Hulukust wa al-Nakbah (Destinies: The Concerto of The Holocaust and The Naqba), by the Palestinian Rabai al-Madhoun. This work received the International Award for Arabic Novels in 2016; and then was translated into Farsi under the title of the Symphony of Destiny. Unfortunately, the translation shows m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016